support RLE and binary mask #150

wangg12 · 2018-11-13T14:47:41Z

No description provided.

facebook-github-bot · 2018-11-13T14:47:45Z

Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please sign up at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need the corporate CLA signed.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

fmassa

This is starting to look pretty good, thanks!

I have a few comments that I think would be good discussing / addressing.

maskrcnn_benchmark/structures/segmentation_mask.py

fmassa · 2018-11-13T16:13:25Z

Also, I think which would be awesome to have would be to add some tests in maskrcnn-benchmark/tests, which we are currently lacking and would make reviewing the changes much easier!

wangg12 · 2018-11-13T16:34:29Z

@fmassa I think it should be consistent with Detectron now.

facebook-github-bot · 2018-11-14T02:38:55Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Facebook open source project. Thanks!

fmassa · 2018-11-14T07:54:52Z

I'll have another closer look today, but would you mind adding some tests in tests/? It will make things much easier!

wangg12 · 2018-11-14T07:57:53Z

@fmassa I'm not familiar with how to write the tests.

fmassa · 2018-11-14T07:59:45Z

Ok, no worries

fmassa

I added a few more comments, but it's not a full review yet.

maskrcnn_benchmark/structures/segmentation_mask.py

wangg12 · 2018-11-14T14:46:47Z

@fmassa I've added some tests about segmentation_mask. However, the transpose and resize can not pass the test. Some help is needed.

fmassa

I don't think we need all those files for this test.

Also, can you give me what is the difference between both implementations (the numerical difference) so that I understand a bit better what's the problem?

tests/common_utils.py

tests/test_segmentation_mask.py

wangg12 · 2018-11-14T15:29:15Z

For resize I can understand there maybe some numerical difference because of the interpolation, but I don't know how to make them equivalent. For transpose the difference is much stranger, I don't know where the difference is.

fmassa · 2018-11-14T15:44:32Z

For the resize, I'd check if the difference appears in the boundaries. Also, the values should be 0-1.
For the transpose, I'd see if the difference disappears if you set this TO_REMOVE = 0. That could explain the difference.

Also, visualizing both results in the image space is going to be very helpful.

wangg12 · 2018-11-14T16:17:55Z

@fmassa For transpose, I changed TO_REMOVE=0 in class Polygons. The difference is smaller but still exists. For resize, I've visualized the result, the difference almost comes from the boundaries.

fmassa · 2018-11-14T16:35:09Z

Ok, this is good progress, thanks!

I'd need to look more closely to see where the difference might come from for the transpose, there might be a few pixels off and this can be observed also by viewing the images.

For resize, I'd need to check a bit more carefully to see if I spot anything, but I might not have the time today nor tomorrow.

JoyHuYY1412 · 2019-01-22T09:04:13Z

@fmassa For transpose, I changed TO_REMOVE=0 in class Polygons. The difference is smaller but still exists. For resize, I've visualized the result, the difference almost comes from the boundaries.

If I use the segmentation_mask.py you wrote, are there other changes I should apply on other files?
It looks like the functions and ports fit the original code well.
What's more, are there other things I should take attention? Maybe I can try the transforms without transposing first? >< Thank you!

fmassa · 2019-01-22T10:48:01Z

@JoyHuYY1412 you'd need to check the interpolation (to use bilinear instead of nearest), apart from that, the rest should be unchanged (or almost).

@txytju do you mind finishing this PR, given that you managed to make it work for your case?

JoyHuYY1412 · 2019-01-22T11:08:11Z

@JoyHuYY1412 you'd need to check the interpolation (to use bilinear instead of nearest), apart from that, the rest should be unchanged (or almost).
Thank you~

IssamLaradji · 2019-01-24T15:01:46Z

I like this!

IssamLaradji · 2019-01-25T21:20:19Z

I faced a problem with cropped_mask = self.mask[box[1]: box[3], box[0]: box[2]] in the part of the code below where I had box[0] and box[2] equal, resulting in width=0, causing an error.

def crop(self, box):
        box = [int(b) for b in box]
        w, h = box[2] - box[0], box[3] - box[1]
        w = max(w, 1)
        h = max(h, 1)
        cropped_mask = self.mask[box[1]: box[3], box[0]: box[2]]
        return Mask(cropped_mask, size=(w, h), mode=self.mode)

This happened because I called cropped_mask = segmentation_mask.crop(proposal) where proposal is tensor([610.0664, 258.8555, 610.7168, 269.9121]) which got rounded to tensor([610, 258, 610, 269])

wangg12 · 2019-01-26T01:48:43Z

@IssamLaradji I think it should be w, h = box[2] - box[0] + 1, box[3] - box[1] + 1.

And I guess round is more suitable than int?

maskrcnn_benchmark/structures/segmentation_mask.py

…-benchmark

fmassa · 2019-02-18T15:56:50Z

@botcs if you could write unit tests for this PR, it would be awesome!

botcs · 2019-02-18T23:50:36Z

using the available test_segmentation_mask, I have tried to visualize the Polygon and the Mask tests, and here is a notebook of my observations:
https://gist.github.com/botcs/95176d877dcd26e48e46cceecdac5763

fmassa · 2019-02-19T09:39:12Z

@botcs that's awesome, thanks for the notebook!
So indeed we have the boundary effects as before.

One last thing I'd like to do to verify if this boundary effects actually matter is to run a training with https://github.com/facebookresearch/maskrcnn-benchmark/blob/master/configs/quick_schedules/e2e_mask_rcnn_R_50_FPN_quick.yaml , which is a fast training and should take ~10 min to run, using the Mask class instead of Polygons.

For that, I'd (locally, this is not to be commited) modify

maskrcnn-benchmark/maskrcnn_benchmark/data/datasets/coco.py

Lines 82 to 84 in f8b0118

    
           masks = [obj["segmentation"] for obj in anno] 
        
           masks = SegmentationMask(masks, img.size) 
        
           target.add_field("masks", masks)

so that it uses Mask instead.

I'd expect to have as results something like

EXPECTED_RESULTS: [['coco_2014_minival', 'box', 'AP', [0.082300, 0.001682]], ['coco_2014_minival', 'mask', 'AP', [0.075039, 0.001872]]]

, so box AP should be around 8.2, and mask AP around 7.5

Could you do that?

Thanks!

botcs · 2019-02-19T13:36:03Z

maskrcnn_benchmark/structures/segmentation_mask.py

+        s += "image_height={}, ".format(self.size[1])
+        s += "mode={})".format(self.mode)
+        return s
+


 class Polygons(object):


I think the mode field for the Polygon is completely irrelevant.
It is never used, but causes:

additional argument passing when constructing

a headache when trying to find out what Polygon really is

Question:
Wouldn't it be more consistent if the convert method of a Polygon would be renamed to convert_to_mask and would return a Mask instance?

Hi,

I agree with your points.

The reason why this is currently the case was that I wanted to keep the same interface between Polygons and Box (which is not implemented, but is the single-box equivalent of BoxList).
And my original idea was that we would be able to specify what was the underlying type of the data via the mode: is it a polygon, or a mask?

I'm not sure about changing the convert name of the method though.

In general, I think both box_list and segmentation_mask could benefit from some better design / cleanup, but I'm not sure what that would be

And my original idea was that we would be able to specify what was the underlying type of the data via the mode: is it a polygon, or a mask?

I think I cannot follow this part:

To specify what was the underlying type of the data via the mode

As in the current implementation, a Polygon instance:

can be initialized either with a list of polygons

can be initialized either with a Polygon instance (which is referenced now, but should be hard-copied IMO)

cannot be initialized with a Mask, which feature could be added if necessary (I am doing this to convert GTA binary masks to COCO Polygon format, but only because the binary masks are not supported).

So the underlying data would be specified: Polygon.

On the other hand, about the convert function:

I'm not sure about changing the convert name of the method though.

The convert function takes an argument for the target mode, but actually it accepts just a single answer, which is odd.

If I assume that a Polygon can be only convert-ed to a Mask than convert name is OK, but relies on the assumption that the data can be represented either in Polygons or Masks and nothing else, which is not necessary a trivial assumption, so changing the name to convert_to_mask would be clear from the very first encounter.

keep the same interface

We should in this case add the convert or convert_to_polygon method to the Mask class as well

Those are all reasonable points, and I'm willing to accept PRs that improve the overall consistency and software design of the codebase

@fmassa Thanks, these points were considered for the refactored version, PR #473

botcs · 2019-02-19T14:20:42Z

@botcs that's awesome, thanks for the notebook!
So indeed we have the boundary effects as before.

One last thing I'd like to do to verify if this boundary effects actually matter is to run a training with https://github.com/facebookresearch/maskrcnn-benchmark/blob/master/configs/quick_schedules/e2e_mask_rcnn_R_50_FPN_quick.yaml , which is a fast training and should take ~10 min to run, using the Mask class instead of Polygons.

For that, I'd (locally, this is not to be commited) modify

maskrcnn-benchmark/maskrcnn_benchmark/data/datasets/coco.py

Lines 82 to 84 in f8b0118
masks = [obj["segmentation"] for obj in anno]
masks = SegmentationMask(masks, img.size)
target.add_field("masks", masks)

so that it uses Mask instead.

I'd expect to have as results something like
EXPECTED_RESULTS: [['coco_2014_minival', 'box', 'AP', [0.082300, 0.001682]], ['coco_2014_minival', 'mask', 'AP', [0.075039, 0.001872]]]
, so box AP should be around 8.2, and mask AP around 7.5

Could you do that?

Thanks!

I have trained the model, which went fine, but the evaluation has failed with the following error:

  File "/home/csbotos/anaconda3/envs/debugmask/lib/python3.7/site-packages/maskrcnn_benchmark-0.1-py3.7-linux-x86_64.egg/maskrcnn_benchmark/structures/segmentation_mask.py", line 206, in __init__
    if not isinstance(segms[0], (list, Polygons)):
IndexError: list index out of range

the whole output can be foun at this gist

botcs · 2019-02-19T15:27:46Z

I have suppressed the error by using single empty instance with an all-zero mask when the provided segms is an empty list. And the results are the following: box AP is 6.0 and mask AP is 5.9

It is quite below the expected performance, and I now rerun the training to see if it is still the case.

botcs · 2019-02-19T15:45:22Z

So I was curious about the expected results in @fmassa 's comment if they were correct, and I have ran the training using Polygons and Mask and the following results came back, two runs each:

Polygon box AP is 4.4 and mask AP is 4.3
Mask box AP is 6.0 and mask AP is 5.9

Which means that the Mask is doing better than the Polygon and the expected results for the training script were higher: bbox 8.2, mask 7.5

JoyHuYY1412 · 2019-02-21T02:43:03Z

@IssamLaradji I think it should be w, h = box[2] - box[0] + 1, box[3] - box[1] + 1.

And I guess round is more suitable than int?

In the code, should be round(float(b)) ?

maskrcnn_benchmark/structures/segmentation_mask.py

botcs · 2019-02-21T07:54:05Z

Hi guys,

A few days ago @fmassa mentioned in one of his comments the following:

In general, I think both box_list and segmentation_mask could benefit from some better design / cleanup, but I'm not sure what that would be

So I tried to reshape things a bit to accommodate better the different requirements, but it has radically changed a few concepts. I would be keen to learn about your opinions on it: #473

IssamLaradji · 2019-02-27T13:24:20Z

I am getting this with this code,

  File "/maskrcnn-benchmark/maskrcnn_benchmark/modeling/roi_heads/mask_head/loss.py", line 39, in project_masks_on_boxes 
    scaled_mask = cropped_mask.resize((M, M)) 
  File "/mnt/home/issam/Research_Ground/domain_adaptation/ann_utils.py", line 282, in resize 
    self.mask[None, None, :, :], (height, width), mode="bilinear" 
  File "/miniconda/envs/py36/lib/python3.6/site-packages/torch/nn/functional.py", line 2447, in interpolate 
    return torch._C._nn.upsample_bilinear2d(input, _output_size(2), align_corners) 
RuntimeError: invalid argument 2: input and output sizes should be greater than 0, but got input (H: 2, W: 0) output (H: 28, W: 28) at /pytorch/aten/src/THNN/generic/SpatialUpSamplingBilinear.c:19 
Uncaught exception. Entering post mortem debugging 
Running 'cont' or 'step' will restart the program

self.mask = tensor([], size=(2, 0))

fmassa · 2019-02-28T09:54:01Z

@IssamLaradji does this also happen with #473 ?

jefequien · 2019-04-02T15:56:13Z

@IssamLaradji I think it should be box = [max(round(float(b)), 0) for b in box]. The bad input size comes from a crop when a box coordinate somehow becomes -1.

fmassa · 2019-04-09T16:28:45Z

Thanks for the initial work on this PR @wangg12 ! This has been merged in #473

IssamLaradji · 2019-04-09T17:32:45Z

Does this work yet for RLE? I am getting this when I pass a list of RLEs.

File "/maskrcnn-benchmark/maskrcnn_benchmark/structures/segmentation_mask.py", line 73, in __init__
    if len(masks.shape) == 2:
AttributeError: 'list' object has no attribute 'shape'

ShihuaiXu · 2019-07-04T04:25:53Z

AttributeError: 'list' object has no attribute 'shape'
I met the same problem!

botcs · 2019-07-06T00:09:40Z

Hi @ShihuaiXu ,
Please visit the updates here

support RLE and binary mask

7df2301

fmassa suggested changes Nov 13, 2018

View reviewed changes

do not convert to numpy

d4226ad

wangg12 added 2 commits November 14, 2018 00:18

be consistent with Detectron

fbadd1c

delete wrong comment

b91dd11

wangg12 mentioned this pull request Nov 14, 2018

Support RLE format and binary masks. #16

Closed

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Nov 14, 2018

fmassa reviewed Nov 14, 2018

View reviewed changes

maskrcnn_benchmark/structures/segmentation_mask.py Outdated Show resolved Hide resolved

maskrcnn_benchmark/structures/segmentation_mask.py Show resolved Hide resolved

maskrcnn_benchmark/structures/segmentation_mask.py Outdated Show resolved Hide resolved

[WIP] add tests for segmentation_mask

febb542

fmassa suggested changes Nov 14, 2018

View reviewed changes

tests/common_utils.py Outdated Show resolved Hide resolved

tests/test_segmentation_mask.py Outdated Show resolved Hide resolved

update tests

7f22baa

fmassa mentioned this pull request Dec 10, 2018

Support instance mask annotation with mask.png #256

Open

fmassa mentioned this pull request Jan 24, 2019

Convert binary mask to polygon #381

Closed

ZENGXH reviewed Feb 2, 2019

View reviewed changes

maskrcnn_benchmark/structures/segmentation_mask.py Outdated Show resolved Hide resolved

Gu Wang added 2 commits February 17, 2019 23:18

Merge branch 'master' of https://github.com/facebookresearch/maskrcnn…

698010f

…-benchmark

minor change

b8c5bca

botcs suggested changes Feb 19, 2019

View reviewed changes

akshitac8 mentioned this pull request Feb 20, 2019

Segmentation mask generated contains strings and letters at count keyword #471

Closed

JoyHuYY1412 reviewed Feb 21, 2019

View reviewed changes

maskrcnn_benchmark/structures/segmentation_mask.py Outdated Show resolved Hide resolved

botcs mentioned this pull request Feb 21, 2019

Support Binary Mask with transparent SegmentationMask interface #473

Merged

fmassa mentioned this pull request Feb 22, 2019

KeyError: 'bbox' , self created dataset #480

Closed

minor fix

e3e0a53

wangg12 closed this Apr 9, 2019

support RLE and binary mask #150

support RLE and binary mask #150

Conversation

wangg12 commented Nov 13, 2018

facebook-github-bot commented Nov 13, 2018

fmassa left a comment

Choose a reason for hiding this comment

fmassa commented Nov 13, 2018

wangg12 commented Nov 13, 2018

facebook-github-bot commented Nov 14, 2018

fmassa commented Nov 14, 2018

wangg12 commented Nov 14, 2018

fmassa commented Nov 14, 2018

fmassa left a comment

Choose a reason for hiding this comment

wangg12 commented Nov 14, 2018

fmassa left a comment

Choose a reason for hiding this comment

wangg12 commented Nov 14, 2018

fmassa commented Nov 14, 2018

wangg12 commented Nov 14, 2018

fmassa commented Nov 14, 2018

JoyHuYY1412 commented Jan 22, 2019

fmassa commented Jan 22, 2019

JoyHuYY1412 commented Jan 22, 2019

IssamLaradji commented Jan 24, 2019

IssamLaradji commented Jan 25, 2019 • edited Loading

wangg12 commented Jan 26, 2019 • edited Loading

fmassa commented Feb 18, 2019

botcs commented Feb 18, 2019

fmassa commented Feb 19, 2019

botcs Feb 19, 2019 • edited Loading

Choose a reason for hiding this comment

fmassa Feb 19, 2019

Choose a reason for hiding this comment

botcs Feb 19, 2019

Choose a reason for hiding this comment

fmassa Feb 22, 2019

Choose a reason for hiding this comment

botcs Feb 25, 2019

Choose a reason for hiding this comment

botcs commented Feb 19, 2019

botcs commented Feb 19, 2019 • edited Loading

botcs commented Feb 19, 2019 • edited Loading

JoyHuYY1412 commented Feb 21, 2019

botcs commented Feb 21, 2019 • edited Loading

IssamLaradji commented Feb 27, 2019

fmassa commented Feb 28, 2019

jefequien commented Apr 2, 2019 • edited Loading

fmassa commented Apr 9, 2019 • edited Loading

IssamLaradji commented Apr 9, 2019 • edited Loading

ShihuaiXu commented Jul 4, 2019

botcs commented Jul 6, 2019

IssamLaradji commented Jan 25, 2019 •

edited

Loading

wangg12 commented Jan 26, 2019 •

edited

Loading

botcs Feb 19, 2019 •

edited

Loading

botcs commented Feb 19, 2019 •

edited

Loading

botcs commented Feb 19, 2019 •

edited

Loading

botcs commented Feb 21, 2019 •

edited

Loading

jefequien commented Apr 2, 2019 •

edited

Loading

fmassa commented Apr 9, 2019 •

edited

Loading

IssamLaradji commented Apr 9, 2019 •

edited

Loading